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for the invention entitled: 

"Novel molecules" 
The invention is described in the following statement: 
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NOVEL MOLECULES 

The present invention related generally to novel molecules and more particularly novel 
5 proteinaceous molecules involved in or associated with regulation of cell activities and/or 
viability. The present invention is particularly directed to novel serine proteinases and a 
novel kinase and to derivatives, agonists and antagonists thereof. 

Sequence Identity Numbers (SEQ ID NOs.) for the nucleotide and amino acid sequences 
10 referred to in the specification are defined at the end of the subject specification. 

Throughout this specification and the claims which follow, unless the context requires 
otherwise, the word "comprise", or variations such as "comprises" or "comprising", will be 
understood to imply the inclusion of a stated integer or group of integers but not the exclusion 
15 of any other integer or group of integers. 

The rapidly increasing sophistication of recombinant DNA technology is greatly facilitating 
research and development in the medical and allied health fields. This is particularly the case 
in the area of cell regulation leading to a greater understanding of the events leading to or 
20 involved in cancer, development of acquired immunodeficiency disease syndrome (AIDS), 
neurological disorders, heart disease, tissue graft rejection amongst many other conditions. 

Two particularly important classes of molecules are the proteinases and kinases. 

25 Proteinases play important roles in a nimiber of physiological and pathological processes such 
as proteolytic cascades involved in blood coagulation, fibrinolysis and complement activation 
as well as cleavage of growth factors, hormones and receptors, the release of bioactive 
molecules and processes involving cell proliferation and development, inflammation, tumour 
growth and metastasis. Of particular significance are the cellular proteinases, or those 

30 proteinases synthesized in cells and tissues which serve to activate or deactivate proteins 
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responsible for performing specific functions. These proteinases may be found outside the 
cell, within the cell or may be present on the cell surface. 

Serine proteinases are particularly important. These proteinases are characterised by a 
5 mechanism involving serine, histidine and aspartate amino acids in the serine proteinase active 
site. Members of the serine proteinase family which play important roles in a range of 
cellular functions and which have demonstrated causative roles in human diseases include 
tissue-type plasminogen activator and thrombm (thrombosis and blood clotting), urokinase- 
type plasminogen activator (cancer and metastasis), trypsin and elastase (emphysema and liver 
10 disease) and angiotensin converting enzyme (hypertension). 

A serine proteinase is also implicated in TNFa degradation and soluble TNF-receptor (p75) 
release by THPl cells (Vey et al. Eur. J. Imm. 26, 2404-2409, 1996). Serine protemases 
have been implicated in the activation of macrophages (Nakabo et al. J. Leukocyte BioL 60, 

15 328-336, 1996), in nuclear lamin degradation in apoptosis (McConkey et al. J.Biol. Chem., 
271, 22398-22406, 1996), in prostaglandin-E2 induced release of soluble TNF receptor 
shedding (Choi et al. Cellular Immunology 170, 178-184, 1996), in PAF sysnthesis 
(Bussolino et al. Eur. J. Immunol. 24, 3131-3139, 1994), and in the proteolysis of IkB, a 
regulatory molecule important in signal transduction and apoptosis. Release of serine 

20 proteinases known as granzymes is central to CTL killing and many of the substrates cleaved 
by granzymes are also cleaved by cellular proteinases (for example, IL-lp is a substrate for 
Granzyme B as well as the cysteine proteinase, interleukin Ip-converting enzyme (ICE)). 
Granzyme A, a serine proteinase with Arg-amidolytic activity, has been reported to induce the 
production of IL-6 and IL-8 in lung fibroblasts (Sower et al. Cellular Immunology 171, 159- 

25 163, 1996) and cleaves IL-lp to a 17kD mature form that is biologically active. 

Kinases are a large group of molecules, many of which regulate the response of cells to 
external stimuli. These molecules regulate proliferation and differentiation in eukaryotic cells 
frequently via signal transduction pathways. 
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The identification of new serine proteinases and kinases permits the development of a range 
of derivatives, agonists and antagonists at tlie nucleic acid and protein levels which in turn 
have applications in the treatment and diagnosis of a range of conditions such as cancer, 
inflammation, neurological disorders amongst many other conditions including conditions 
S which initiate or promote apoptosis such as viral infection, old age and drug abuse. 

Accordingly, one aspect of the present invention provides a novel molecule in isolated form 
involved in or associated with regulation of cell activity and/or viability. 

10 More particularly, the present invention contemplates a novel serine proteinase in isolated 
form comprising a sequence of amino acids encoded by a nucleotide sequence, at least a 
portion of which, is capable of being amplified by polymerase chain reaction (PGR) using the 
following primers: 

15 5' ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID NO:l]; and 
5' ACACTTAAGAXIGGICCICCIC/GT/AXTCICC3' [SEQ ID NO:2]; 
or a complementary form of said primers. 

20 

In a particularly preferred embodiment, the isolated serine proteinase comprises the amino 
acid sequence substantially set forth in SEQ ID NO:4 or an amino acid sequence having at 
least about 50% similarity to all or part thereof. This serine proteinase is referred to herein 
as -HELA2" and is a short isoform. 

25 

In another preferred embodiment, the amino acid sequence of the serine proteinase is 
substantially as set forth in SEQ ID NO: 6 or an amino acid sequence having at least about 
50% similarity to all or part thereof. This serine proteinase is referred to herein as "HELA^" 
and is a long isoform. 

30 
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Yet another preferred embodiment of the present invention provide an amino acid sequence 
substantially as set forth in SEQ ID NO: 8 or an amino acid sequence having at least about 
50% similarity to all or part thereof. This serine proteinase is referred to herein as "ATC2". 

5 Yet another aspect of the present invention provides a serine proteinase in isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence substantially as set 
forth in SEQ ID NO: 3 or a nucleotide sequence having at least 50% similarity to all or part 
thereof or a nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID 
NO: 3 under low stringency conditions. 

10 

Yet another aspect of the present invention provides a serine proteinase in isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence substantially as set 
forth in SEQ ID NO: 5 or a nucleotide sequence having at least 50% similarity to all or part 
thereof or a nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID 
15 NO:5 under low stringency conditions. 

Still another aspect of the present invention provides a serine proteinase in isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence substantially as set 
forth in SEQ ID NO:7 or a nucleotide sequence having at least 50% similarity to all or part 
20 thereof or a nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID 
NO:7 under low stringency conditions. 

Another embodiment of the present invention is directed to a kinase in isolated form 
comprising an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having 
25 50% amino acid similarity to all or part thereof. This kinase is referred to herein as 
"BCON3". 

In a related embodiment, the kinase comprises an amino acid sequence encoded by a 
nucleotide sequence substantially as set forth in SEQ ID NO: 9 or a nucleotide sequence 
30 having at least 50% similarity to all or part of the nucleotide sequence set forth in SEQ ID 
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NO: 9 or a nucleotide sequence capable of hybridising to the nucleotide sequence set forth in 
SEQ ID N0:9 under low stringency conditions. 

The present invention further provides an isolated nucleic acid molecule comprising a 
5 sequence of nucleotides substantially as set forth in SEQ ID NO:3 or having 50% similarity 
to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID NO: 3 under 
low stringency conditions. 

Another aspect of the present invention provides an isolated nucleic acid molecule comprising 
10 a sequence of nucleotides substantially as set forth in SEQ ID NO:5 or having 50% similarity 
to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID NO:5 under 
low stringency conditions. 

Another aspect of the present invention is directed to an isolated nucleic acid molecule 
15 comprising a sequence of nucleotides substantially as set forth in SEQ ID NO:7 or having 
50% similarity to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ 
ID NO:7 under low stringency conditions. 

Still another aspect of the present invention provides an isolated nucleic acid molecule 
20 comprising a sequence of nucleotides substantially as set forth in SEQ ID NO:9 or having 
50% similarity to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ 
ID NO:9 under low stringency conditions. 

Reference herein to a low stringency includes low stringency at 42^C includes and encompasses 
25 from at least about 1% v/v to at least about 15% v/v formamide and from at least about IM to 
at least about 2M salt for hybridisation, and at least about IM to at least about 2M salt for 
washing conditions. Altemative stringency conditions may be applied where necessary, such 
as medium stringency, which includes and encompasses from at least about 16% v/v to at least 
about 30% v/v formamide and from at least about 0.5M to at least about 0.9M salt for 
30 hybridisation, and at least about 0.5M to at least about 0.9M salt for washing conditions, or high 
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stringency, which includes and encompasses from at least about 31% v/v to at least about 50% 
v/v formamide and from at least about O.OIM to at least about 0. 15M salt for hybridisation, and 
at least about O.OIM to at least about 0. 15M salt for washing conditions. 

5 Reference herein to similarity to "part" of a sequence means similarity to at least about 5 
contiguous amino acids or at least about 15 contiguous nucleotide bases and more preferably 
at least about 9 contiguous amino acids or at least about 27 contiguous nucleotide bases. 

The term ''similarity" includes exact identity between sequences or, where the sequence 
10 differs, different amino acids are related to each other at the structural, functional, 
biochemical and/or conformational levels. 

The term "isolated" includes biological purification and biological separation and encompasses 
molecules having undergone at least one purification, concentration or separation step relative 
15 to its natural environment. For example, a preparation may comprise at least about 10%, 
preferably at least about 20%, more preferably at least about 30%, still more preferably at 
least about 50% or greater of the molecule relative to at least one other component in a 
composition as determined by activity, mass, amino acid content, nucleotide content or other 
convenient means. 

20 

Hereinafter, the molecules of the present invention are referred to as a "proteinase/kinase". 
The term "proteinase/kinase" includes the serine proteinases HEI-A2 and ATC2 and the kinase 
BCON3. The proteinase/kinase of the present invention may be in isolated, naturally 
occurring form or recombinant or synthetic form or chemical analogues thereof. 

25 

The proteinase/kinase of the present invention is preferably of hiunan origin but from non- 
human origins are also encompassed by the present invention. Non-human animals 
contemplated by the present invention include primates, livestock animals (e.g. sheep, cows, 
pigs, goats, horses, donkeys), laboratory test animals (e.g. mice, rats, guinea pigs, hamsters, 
30 rabbits), domestic companion animals (e.g. dogs, cats), birds (e.g. chickens, geese, ducks and 
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other poultry birds, game birds, emus, ostriches) and captive wild or tamed animals (e.g. foxes, 
kangaroos, dingoes). The present invention also encompasses a proteinase/kinase homologue 
from Xenopus and plants. 

5 The nucleic acid molecules encoding a proteinase/kinase may be genomic DNA, cDNA or RNA 
such as mRNA. 

Another aspect of the present invention contemplates a method for cloning a nucleotide 
sequence encoding a novel serine proteinase, said method comprising screening a nucleic acid 
10 library with said one or more or oligonucleotides defined by SEQ ED N0:1 and/or SEQ ED 
NO: 2 and obtaining a clone therefrom which encodes said novel serine proteinase or part 
thereof. 

Preferably, the nucleic acid library is genomic DNA, cDNA, genomic or mRNA library. 

15 

Preferably, the nucleic acid library is a cDNA expression library. 

Preferably, the nucleic acid library is of human origin such as from brain, liver, kidney, neo- 
natal tissue, embryonic tissue, tumour or cancer tissue. 

20 

Still another embodiment contemplates the promoter or a functional part thereof of the genomic 
gene encoding the subject proteinase/kinase of the present invention. The promoter may readily 
be obtained by, for example, "chromosome walking". 

25 The present invention further contemplates a range of derivatives of the subject 
proteinase/kinase. Derivatives include fragments, parts, portions, mutants, homologues and 
analogues of the subject polypeptides and corresponding genetic sequences. Derivatives also 
include single or multiple amino acid substitutions, deletions and/or additions to the subject 
molecules or single or multiple nucleotide substitutions, deletions and/or additions to the 

30 genetic sequence encoding the molecules. '"Additions" to amino acid sequences or nucleotide 
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sequences include fusions with other peptides, polypeptides or proteins or fusions to nucleotide 
sequences. Reference herein to the serine proteinase and kinase includes reference to all 
derivatives thereof including functional derivatives or immunologically interactive derivatives. 

5 Analogues of the subject serine proteinase and kinase contemplated herein include, but are not 
limited to, modification to side chains, incorporating of unnatural amino acids and/or their 
derivatives during peptide, polypeptide or protein synthesis and the use of crosslinkers and 
other methods which impose conformational constraints on the proteinaceous molecule or their 
analogues. 

10 

Examples of side chain modifications contemplated by the present invention include 
modifications of amino groups such as by reductive alkylation by reaction with an aldehyde 
followed by reduction with NaBH4; amidination with methylacetimidate; acylation with acetic 
anhydride; carbamoylation of amino groups with cyanate; trinitrobenzylation of amino groups 
15 with 2, 4, 6-trinitroben2ene sulphonic acid (TNBS); acylation of amino groups with succinic 
anhydride and tetrahydrophthalic anhydride; and pyridoxylation of lysine with pyridoxal-5- 
phosphate followed by reduction with NaBH4. 

The guanidine group of arginine residues may be modified by the formation of heterocyclic 
20 condensation products with reagents such as 2,3-butanedione, phenylglyoxal and glyoxal. 

The carboxyl group may be modified by carbodiimide activation via O-acylisourea formation 
followed by subsequent derivitisation, for example, to a corresponding amide. 

25 Sulphydryl groups may be modified by methods such as carboxymetfaylation with iodoacetic 
acid or iodoacetamide; performic acid oxidation to cysteic acid; formation of a mixed 
disulphides with other thiol compounds; reaction with maleimide, maleic anhydride or other 
substituted maleimide; formation of mercurial derivatives using 4-chloromercuribeiizoate, 4- 
chloromercuriphenylsulphonic acid, phenylmercury chloride, 2-chloromercuri-4-nitrophenol 

30 and other mercurials; carbamoylation with cyanate at alkaline pH. 
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Tryptophan residues may be modified by, for example, oxidation with N-bromosuccinimide or 
aikylation of the indole ring with 2-hydroxy-5-nitrobenzyl bromide or sulphenyl halides. 
Tyrosine residues on the other hand, may be altered by nitration with tetranitromethane to form 
a 3-nitrotyrosine derivative. 

5 

Modification of the imidazole ring of a histidine residue may be accomplished by aikylation 
with iodoacetic acid derivatives or N-carbethoxylation with diethylpyrocarbonate. 

Examples of incorporating unnatural amino acids and derivatives during peptide synthesis 
10 include, but are not limited to, use of norleucine, 4-amino butyric acid, 4-amino-3-hydroxy-5- 
phenylpentanoic acid, 6-aminohexanoic acid, t-butylglycine, norvaline, phenylglycine, 
ornithine, sarcosine, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-thienyl alanine and/or D- 
isomers of amino acids. A list of unnatural amino acid, contemplated herein is shown in Table 
1. 

15 

Crosslinkers can be used, for example, to stabilise 3D conformations, using homo-bifunctional 
crosslinkers such as the bifunctional imido esters having (CH2)n spacer groups with n=l to 
n=6, glutaraldehyde, N-hydroxysuccinimide esters and hetero-bifunctional reagents which 
usually contain an amino-reactive moiety such as N-hydroxysuccinimide and another group 

20 specific-reactive moiety such as maleimido or dithio moiety (SH) or carbodiimide (COOH). 
In addition, peptides can be conformationally constrained by, for example, incorporation of 
and Nj,-methylamino acids, introduction of double bonds between and atoms of amino 
acids and the formation of cyclic peptides or analogues by introducing covalent bonds such as 
forming an amide bond between the N and C termini, between two side chains or between a 

25 side chain and the N or C terminus. 

These types of modifications may be important to stabilise the proteinase/kinase if administered 
to an individual or for use as a diagnostic reagent. 

30 The present invention further contemplates chemical analogues of the proteinase/kinase capable 
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of acting as antagonists or agonists of the native molecules or which can act as functional 
analogues of the native molecules. For example, an antagonist may be a proteinase inhibitor. 
Chemical analogues may not necessarily be derived from the subject enzymes but may share 
certain conformational similarities. Alternatively, chemical analogues may be specifically 
5 designed to mimic certain physiochemical properties of the serine proteinases or kinases. 
Chemical analogues may be chemically synthesised or may be detected following, for example, 
natural product screening. 

The identification of the novel molecules of the present invention permits the generation of a 
10 range of therapeutic molecules capable of modulating expression of their native counterparts 
or modulating their activity. Modulators contemplated by the present invention includes 
agonists and antagonists of proteinase/kinase expression. Antagonists of proteinase/kinase 
expression include antisense molecules, ribozymes and co-suppression molecules. Agonists 
include molecules which increase promoter ability or interfere with negative regulatory 
15 mechanisms. Agonists of proteinase/kinase include molecules which overcome any negative 
regulatory mechanism. Antagonists of the proteinase/kinase include antibodies and inhibitor 
peptide fragments. 
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TABLE 1 



Non-conventional Code Non-conventional Code 

amino acid amino acid 

5 



a-aminobutyric acid 


Abu 


L-N-methylalanine 


Nmala 


a-amino-a-methylbutyrate 


Mgabu 


L-N-methylarginine 


Nmarg 


aminocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 


carboxylate 




L-N-methylaspartic acid 


Nmasp 


aminoisobutyric acid 


Aib 


L-N-methylcysteine 


Nmcys 


aminonorbomyl- 


Norb 


L-N-methylglutamine 


Nmgln 


carboxylate 




L-N-methylglutamic acid 


Nmglu 


cyciohexylalanine 




Chexa L-N-methylhistidine 


Nmhis 


cyciopentylalatdne 


Cpen 


L-N-methylisoUeucine 


Nmile 


D-alanine 


Dal 


L-N-methylleucine 

✓ 


Nmleu 


D-arginine 


Dare 


L-N-methyllysine 


Nmlys 


D-aspartic acid 


DasD 


L-N-methylmethionine 


Nipmet 


D-cysteine 


Dcys 


L-N-methylnorleucine 


Nmnle 


D-giutamiiie 


Dgln 


L-N-methyinorvaline 


Nmnva 


1 ^ • "J 

D*glutamic acid 


Dglu 


L-N-methylomithine 


Nmom 


D-histidine 


Dhis 


L-N-methylphenylalanine 


Nmphe 


D-isoleucine 


Dile 


L-N-methylproline 


Nmpro 


D-leucine 


Dleu 


L-N-methylserine 


Nmser 


D-lysine 


Dlys 


L-N-methyithreonine 


Nmthr 


D-methionine 


Dmet 


L-N-methyltryptophan 


Nmtrp 


D-omithine 


Dom 


L-N-methyltyrosine 


Nmtyr 


D-phenylaianine 


Dphe 


L-N-methylvaline 


Nmval 


D-proline 


Dpro 


L-N-methylethylglycine 


Nmetg 


D-serine 


Dser 


L-N-methyl-t-butylglycine 


Nmtbug 


D-threonine 


Dthr 


L-norleucine 


Nle 
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D-tryptophan 

D-tyrosine 

D-valine 

D-a-methylalanine 
5 D-a-methylarginine 
D-a-methylasparagine 
D-a-methylaspartate 
D-a-methylcysteine 
D-a-methylglutamine 

10 D-a-methylhistidine 
D-a-methylisoleucine 
D-a-methylleucine 
D-a-methyllysine 
D-a-methylmethionine 

15 D-a-methylomithiQe 
D-a-methylphenylalanine 
D-a-methylproline 
D-a-methylserine 
D-a-methylthreonine 

20 D-a-methyltryptophan 
D-a-methyltyrosine 
D-a-methylvaline 
D-N-methylalanine 
D-N-methylarginine 

25 D-N-methylasparagine 
D-N-methylaspartate 
D-N-methylcysteine 
D-N-methylglutamine 
D-N-methylglutamate 

30 D-N-methylhistidine 
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Dtrp 


L-norvaline 


Nva 


Dtyr 


a-methyl-aminoisobutyrate 


Maib 


Dval 


a-methyl-Y-aminobutyrate 


Mgabu 


Dmala 


a-methylcyclohexylalanine 


Mchexa 


Dmarg 


a-methylcylcopentylalanine 


Mcpen 


Dmasn 


a-methyl-a-napthylalanine 


Manap 


Dmasp 


a-methylpenicillamine 


Mpen 


Dmcys 


N-(4-aininobutyl)glycine 


Nglu 


Dmgln 


N-(2-ammoethyl)glycme 


Naeg 


Dmhis 


N-(3 -aminopropy l)gly cine 


Nora 


Dmile 


N-amino-a-methylbutyrate 


Nmaabu 


Dmleu 


a-napthylalanine 


Anap 


Dmlys 


N-benzylglycine 


Nphe 


Dmmet 


N-(2-carbamylethyl)glycine 


Ngin 


Dmom 


N-(carbamylmethyi)glycine 


Nasn 


Dmphe 


N-(2-carboxyethyl)glycine 


Nglu 


Dmpro 


N-(carboxymetliyl)glycine 


Nasp 


Dmser 


N-cyclobutylglycine 


Ncbut 


Dmthr 


N-cycloheptylglycine 


Nchep 


Dmtrp 


N-cyclohexylglycine 


Nchex 


Dmty 


N-cyclodecylglycine 


Ncdec 


Dmval 


N-cylcododecylglycine 


Ncdod 


Dnmala 


N-cyclooctylglycine 


Ncoct 


Dnmeirg 


N-cyclopropylglycine 


Ncpro 


i^nmasn 


IN -cy CAOunoecy igiy cme 


Ncund 


Dnmasp 


N-(2,2-diphenylethyl)glycine 


Nbhm 


Dnmcys 


N-(3 ,3-dipheiiylpropyl)glycine 


Nbhe 


Dnmgln 


N-(3 -guanidinopropy l)gly cine 


Narg 


Dnmglu 


N-( 1 -hydroxy ethyl)glycme 


Nthr 


Dnmhis 


N-(hydroxyethyl))glycine 


Nser 
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D-N-methylisoIeucine 


Dnmile 


N-(imidazolylethyl))glycine 


Nhis 


D-N-methylleucine 


Dnmleu 


N-(3 -indoly lyethyl)glycine 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-methyl-v-aminobutyrate 


Nmgabu 


N-methylcyclohexylalanine 


Nmchexa 


D-N-methylmethionine 


Dnmmet 


5 D-N-methylornithine 


Dnmorn 


N-methylcyclopentylalanine 


Nmcpen 


N-methylglycine 


Nala 


D-N-methylphenyialanine 


Dnmphe 


N-methylaminoisobutyrate 


Nmaib 


D-N-methylproline 


Dnmpro 


N-(l -methylpropyl)glycine 


Nile 


D-N-methylserine 


Dnmser 


N-(2-methylpropyl)gIycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


10 D-N-methyltryptophan 


Dnmtrp 


N-( 1 -methylethyl)gly cine 


Nval 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-naptliylalaiiine 


Nmanap 


D-N-methylvaline 


Dnmval 


N-methylpenicillamine 


Nmpen 


Y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


L-/-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 


15 L-ethylglycine 


Etc 


penicillamine 
* 


Pen 


L-homophenylalanine 


Hphe 


L-a-methylalanine 


Mala 


L-a-methylarginine 


Marg 


L-a-methylasparagine 


Masn 


L-a-methylaspartate 


Masp 


L-a-methyl-/-butylgIycine 


Mtbug 


L-a-methylcysteine 


Mcys 


L-methylethylglycine 


Metg 


20 L-a-methylglutamine 


Mgln 


L-a-methylglutamate 


Mglu 


L-a-methylhistidine 


Mhis 


L-a-methylhomophenylalanine 


Mhphe 


L-a-methylisoleucine 


Mile 


N-(2-methylthioethyl)glycine 


Nmet 




iviieu 






L-a-methylmethionine 


Mmet 


L«a-methylnorleucine 


Mnle 


25 L-a-methylnorvaline 


Mnva 


L-a-methylomithine 


Mom 


L-a-methylphenylalanine 


Mphe 


L-a-methylproline 


Mpro 


L-a-methylserine 


Mser 


L-a-methylthreonine 


Mthr 


L-a-methyltryptophan 


Mtrp 


L-a-methyltyrosine 


Mtyr 
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L-a-methylvaline Mval 
N-(N-(2,2-diphenylethyl) Nnbhm 
carbainylmethyl)glycine 
1 -carboxy- 1 -(2,2-diphenyl- Nmbc 
5 ethylamino)cyclopropane 



L-N-methylhomophenylalanine Nmhphe 
N-(N-(3 ,3 -dipheny Ipropyl) Nnbhe 
carbamylmethyl)glycine 



Other derivatives contemplated by the present invention include a range of glycosylation 
variants from a completely unglycosylated molecule to a modified glycosylated molecule. 
10 Altered glycosylation pattems may result from expression of recombinant molecules in 
different host cells. 

Another embodiment of the present invention contemplates a method for modulating 
expression of proteinase/kinase in a human, said method comprising contacting the 

15 proteinase/kinase gene encoding proteinase/Tcinase with an effective amount of a modulator 
of proteinase/kinase expression for a time and under conditions sufficient to up-regulate or 
down-regulate or otherwise modulate expression of proteinase/kinase. For example, a nucleic 
acid molecule encoding proteinase/kinase or a derivative thereof may be introduced into a cell 
conversely, proteinase/kinase antisense sequences such as oligonucleotides may be 

20 introduced. 

Another aspect of the present invention contemplates a method of modulating activity of 
proteinase/kinase in a human, said method comprising administering to said manmial a 
modulating effective amount of a molecule for a time and under conditions sufficient to 
25 increase or decrease proteinase/kinase activity. The molecule may be a proteinaceous 
molecule or a chemical entity and may also be a derivative of proteinase/kinase or its receptor 
or a chemical analogue or truncation mutant of proteinase/kinase or its receptor. 

Accordingly, the present invention contemplates a pharmaceutical composition comprising 
30 proteinase/kinase or a derivative thereof or a modulator of proteinase/kinase expression or 
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proteinase/kinase activity and one or more pharmaceutically acceptable carriers and/or 
diluents. These components are referred to as the active ingredients. 

The pharmaceutical forms suitable for injectable use include sterile aqueous solutions (where 
5 water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion or may be in the form of a cream or other form suitable for 
topical application. It must be stable under the conditions of manufacture and storage and 
must be preserved against the contaminating action of microorganisms such as bacteria and 
fungi. The carrier can be a solvent or dispersion medium containing, for example, water, 

10 ethanol, polyol (for example, glycerol, propylene glycol and liquid polyethylene glycol, and 
the like), suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, 
for example, by the use of a coating such as licithin, by the maintenance of the required 
particle size in the case of dispersion and by the use of superfactants. The preventions of the 
action of microorganisms can be brought about by various antibacterial and antifungal agents, 

15 for example, parabens, chlorobutanol, phenol, sorbic acid, thirmerosal and the like. In many 
cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in the 
compositions of agents delaying absorption, for example, aluminum monostearate and gelatin. 

20 Sterile injectable solutions are prepared by incorporating the active compounds in the required 
amount in the appropriate solvent with various of the other ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 
incorporating the various sterilized active ingredient into a sterile vehicle which contains the 
basic dispersion medium and the required other ingredients from those enumerated above. In 

25 the case of sterile powders for the preparation of sterile injectable solutions, the preferred 
methods of preparation are vacuum drying and the freeze-drying technique which yield a 
powder of the active ingredient plus any additional desired ingredient from previously 
sterile-filtered solution thereof. 

30 When the active ingredients are suitably protected they may be orally administered, for 
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example, with an inert diluent or with an assimilable edible carrier, or it may be enclosed in 
hard or soft shell gelatin capsule, or it may be compressed into tablets, or it may be 
incorporated directly with the food of the diet. For oral therapeutic administration, the active 
compound may be incorporated with excipients and used in the form of ingestible tablets, 
S buccal tablets, troches, capsules, elixirs, suspensions, syrups, wafers, and the like. Such 
compositions and preparations should contain at least 1% by weight of active compound. The 
percentage of the compositions and preparations may, of course, be varied and may 
conveniently be between about 5 to about 80% of the weight of the unit. The amount of active 
compound in such therapeutically useful compositions in such that a suitable dosage will be 
10 obtained. Preferred compositions or preparations according to the present invention are 
prepared so that an oral dosage unit form contains between about 0. 1 ug and 2000 mg of active 
compoimd. 

The tablets, troches, pills, capsules and the like may also contain the components as listed 
IS hereafter. A binder such as gum, acacia, com starch or gelatin; excipients such as dicalcium 
phosphate; a disintegrating agent such as com starch, potato starch, alginic acid and the like; 
a lubricant such as magnesium stearate; and a sweetening agent such a sucrose, lactose or 
saccharin may be added or a flavouring agent such as pepperaiint, oil of wintergreen, or cherry 
flavouring. When the dosage unit form is a capsule, it may contain, in addition to materials 
20 of the above type, a liquid carrier. Various other materials may be present as coatings or to 
otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or capsules 
may be coated with shellac, sugar or both. A syrup or elixir may contain the active compound, 
sucrose as a sweetening agent, methyl and propylparabens as preservatives, a dye and 
flavouring such as cherry or orange flavour. Of course, any material used in preparing any 
25 dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts 
employed. In addition, the active compound(s) may be incorporated into sustained-release 
preparations and formulations. 
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The present invention also extends to forms suitable for topical application such as creams, 
lotions and gels. 

Pharmaceutically acceptable carriers and/or diluents include any and all solvents, dispersion 
5 media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents 
and the like. The use of such media and agents for pharmaceutical active substances is well 
known in the art. Except insofar as any conventional media or agent is incompatible with the 
active ingredient, use thereof in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. 

10 

It is especially advantageous to formulate parenteral compositions in dosage unit form for ease 
of administration and uniformity of dosage. Dosage unit form as used herein refers to 
physically discrete units suited as unitary dosages for the manunalian subjects to be treated; 
each unit containing a predetermined quantity of active material calculated to produce the 

15 desired therapeutic effect in association with the required pharmaceutical carrier. The 
specification for the novel dosage unit forms of the invention are dictated by and directly 
dependent on (a) the unique characteristics of the active material and the particular therapeutic 
effect to be achieved, and (b) the Umitations inherent in the art of compounding such an active 
material for the treatment of disease in living subjects having a diseased condition in which 

20 bodily health is impaired as herein disclosed in detail. 

The principal active ingredient is compounded for convenient and effective administration in 
effective amounts with a suitable pharmaceutically acceptable carrier in dosage unit form as 
hereinbefore disclosed. A unit dosage form can, for example, contain the principal active 
25 compound in amounts ranging from 0.5 [ig to about 2000 mg. Expressed in proportions, the 
active compound is generally present in from about 0.5 ^g to about 2000 mg/ml of carrier. 
In the case of compositions containing supplementary active ingredients, the dosages are 
determined by reference to the usual dose and manner of administration of the said 
ingredients. 
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The pharmaceutical composition may also comprise genetic molecules such as a vector 
capable of transfecting target cells where the vector carries a nucleic acid molecule capable 
of modulating proteinase/kinase expression or proteinase/kinase activity. The vector may, for 
example, be a viral vector. 

5 

Still another aspect of the present invention is directed to antibodies to proteinase/kinase and 
its derivatives. Such antibodies may be monoclonal or polyclonal and may be selected from 
naturally occurring antibodies to proteinase/kinase or may be specifically raised to 
proteinase/kinase or derivatives thereof. In the case of the latter, proteinase/kinase or its 
10 derivatives may first need to be associated with a carrier molecule. The antibodies and/or 
recombinant proteinase/kinase or its derivatives of the present invention are particularly useful 
as therapeutic or diagnostic agents. 

For example, proteinase/kinase and its derivatives can be used to screen for naturally occurring 
15 antibodies to proteinase/kinase. These may occur, for example in some autoimmune diseases. 
Alternatively, specific antibodies can be used to screen for proteinase/kinase. Techniques for 
such assays are well known in the art and include, for example, sandwich assays and ELIS A. 
Knowledge of proteinase/kinase levels may be important for diagnosis of certain cancers or 
a predisposition to cancers or for monitoring certain therapeutic protocols/ 

20 

Antibodies the proteinase/kinase of the present invention may be monoclonal or polyclonal. 
Altematively, fragments of antibodies may be used such as Fab fragments. Furthermore, the 
present invention extends to recombinant and synthetic antibodies and to antibody hybrids. 
A "synthetic antibody" is considered herein to include fragments and hybrids of antibodies. 
25 The antibodies of this aspect of the present invention are particularly useful for 
immunotherapy and may also be used as a diagnostic tool for assessing apoptosis or 
monitoring the program of a therapeutic regimin. 



For example, specific antibodies can be used to screen for proteinase/kinase proteins. The 
30 latter would be important, for example, as a means for screening for levels of proteinase/kinase 
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in a cell extract or other biological fluid or purifying proteinase/kinase made by recombinant 
means from culture supernatant fluid. Techniques for the assays contemplated herein are 
known in the art and include, for example, sandwich assays and ELISA. 

5 It is within the scope of this invention to include any second antibodies (monoclonal, 
polyclonal or fragments of antibodies or synthetic antibodies) directed to the first mentioned 
antibodies discussed above. Both the first and second antibodies may be used in detection 
assays or a first antibody may be used with a commercially available anti-immunoglobulin 
antibody. An antibody as contemplated herein includes any antibody specific to any region 
10 of proteinase/kinase. 

Both polyclonal and monoclonal antibodies are obtainable by immunization with the enzyme 
or protein and either type is utilizable for immunoassays. The methods of obtaining both types 
of sera are well known in the art. Polyclonal sera are less preferred but are relatively easily 
15 prepared by injection of a suitable laboratory animal with an effective eunount of 
proteinase/kinase, or antigenic parts thereof, collecting serum from the animal, and isolating 
specific sera by any of the known immunoadsorbent techniques. Although antibodies 
produced by this method are utilizable in virtually any type of immunoassay, they are 
generally less favoured because of the potential heterogeneity of the product. 

20 

The use of monoclonal antibodies in an immunoassay is particularly preferred because of the 
ability to produce them in large quantities and the homogeneity of the product. The 
preparation of hybridoma cell lines for monoclonal antibody production derived by fusing an 
immortal cell line and lymphocytes sensitized against the inmiunogenic preparation can be 
25 done by techniques which are well known to those who are skilled in the art. 

Another aspect of the present invention contemplates a method for detecting proteinase/kinase 
in a biological sample from a subject said method comprising contacting said biological 
sample with an antibody specific for proteinase/kinase or its derivatives or homologues for a 
30 time and under conditions sufficient for an antibody-proteinase/kinase complex to form, and 



P:V0PER\EIH\NOVELM.PRV- 13/2/97 



-50- 

then detecting said complex. 

The presence of proteinase/kinase may be accomplished in a number of ways such as by 
Western blotting and ELISA procedures, A wide range of immunoassay techniques are 
5 available as can be seen by reference to US Patent Nos. 4,016,043, 4, 424,279 and 4,01 8,653. 
These, of course, includes both single-site and two-site or "sandwich" assays of the non- 
competitive types, as well as in the traditional competitive binding assays. These assays also 
include direct binding of a labelled antibody to a target. 

10 Sandwich assays are among the most useful and conunonly used assays and are favoured for 
use in the present invention. A number of variations of the sandwich assay technique exist, 
and all are intended to be encompassed by the present invention. Briefly, in a typical forward 
assay, an unlabelled antibody is inmiobilized on a solid substrate and the sample to be tested 
brought into contact with the bound molecule. After a suitable period of incubation, for a 

15 period of time sufficient to allow formation of an antibody-antigen complex, a second 
antibody specific to the antigen, labelled with a reporter molecule capable of producing a 
detectable signal is then added and incubated, allowing time sufficient for the formation of 
another complex of antibody-antigen-labelled antibody. Any unreacted material is washed 
away, and the presence of the antigen is determined by observation of a signal produced by 

20 the reporter molecule. The results may either be qualitative, by simple observation of the 
visible signal, or may be quantitated by comparing with a control sample containing known 
amounts of hapten. Variations on the forward assay include a simultaneous assay, in which 
both sample and labelled antibody are added simultaneously to the bound antibody. These 
techniques are well known to those skilled in the art, including any minor variations as will 

25 be readily apparent In accordance with the present invention the sample is one which might 
contain proteinase/kinase including cell extract, tissue biopsy or possibly serum, saliva, 
mucosal secretions, lymph, tissue fluid and respiratory fluid. The sample is, therefore, 
generally a biological sample comprising biological fluid but also extends to fermentation 
fluid and supernatant fluid such as from a cell culture. 

30 
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In the typical forward sandwich assay, a first antibody having specificity for the 
proteinase/kinase or antigenic parts thereof, is either covalently or passively bound to a solid 
surface. The solid surface is typically glass or a polymer, the most commonly used polymers 
being cellulose, polyacrylamide, nylon, polystyrene, polyvinyl chloride or polypropylene. The 
5 solid supports may be in the form of tubes, beads, discs of microplates, or any other surface 
suitable for conducting an immunoassay. The binding processes are well-known in the art and 
generally consist of cross-linking covalently binding or physically adsorbing, the polymer- 
antibody complex is washed in preparation for the test sample. An aliquot of the sample to 
be tested is then added to the solid phase complex and incubated for a period of time sufficient 
10 (e.g. 2-40 minutes) and under suitable conditions (e.g. 25 °C) to allow binding of any subunit 
present in the antibody. Following the incubation period, the antibody subunit solid phase is 
washed and dried and incubated with a second antibody specific for a portion of the hapten. 
The second antibody is linked to a reporter molecule which is used to indicate the binding of 
the second antibody to the hapten. 

15 

An alternative method involves immobilizing the target molecules in the biological sample and 
then exposing the immobilized target to specific antibody which may or may not be labelled 
with a reporter molecule. Depending on the amount of target and the strength of the reporter 
molecule signal, a bound target may be detectable by direct labelling with the antibody. 
20 Alternatively, a second labelled antibody, specific to the first antibody is exposed to the target- 
first antibody complex to form a target-first antibody-second antibody tertiary complex. The 
complex is detected by the signal emitted by the reporter molecule. 

By "reporter molecule" as used in the present specification, is meant a molecule which, by its 
25 chemical nature, provides an analytically identifiable signal which allows the detection of 
antigen-bound antibody. Detection may be either qualitative or quantitative. The most 
commonly used reporter molecules in this type of assay are either enzymes, fluorophores or 
radionuclide containing molecules (i.e. radioisotopes) and chemiluminescent molecules. 
In the case of an enzyme immunoassay, an enzyme is conjugated to the second antibody, 
30 generally by means of glutaraldehyde or periodate. As will be readily recognized, however. 
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a wide variety of different conjugation techniques exist, which are readily available to the 
skilled artisan. Commonly used enzymes include horseradish peroxidase, glucose oxidase, 
beta-galactosidase and alkaline phosphatase, amongst others. The substrates to be used with 
the specific enzymes are generally chosen for the production, upon hydrolysis by the 
5 corresponding enzyme, of a detectable colour change. Examples of suitable enzymes include 
alkaline phosphatase and peroxidase. It is also possible to employ fluorogenic substrates, 
which yield a fluorescent product rather than the chromogenic substrates noted above. In all 
cases, the enzyme-labelled antibody is added to the first antibody hapten complex, allowed to 
bind, and then the excess reagent is washed away, A solution containing the appropriate 
10 substrate is then added to the complex of antibody-antigen-antibody. The substrate will react 
with the enzyme Unked to the second antibody, giving a qualitative visual signal, which may 
be further quantitated, usually spectrophotometrically, to give an indication of the amount of 
hapten which was present in the sample. "Reporter molecule" also extends to use of cell 
agglutination or inhibition of agglutination such as red blood cells on latex beads, and the like. 

15 

Alternately, fluorescent compounds, such as fluorescein and rhodamine, may be chemically 
coupled to antibodies without altering their binding capacity. When activated by illumination 
with light of a particular wavelength, the fluorochrome-labelled antibody adsorbs the light 
energy, inducing a state to excitability in the molecule, followed by emission of the light at 

20 a characteristic colour visually detectable with a light microscope. As in the EIA, the 
fluorescent labelled antibody is allowed to bind to the first antibody-hapten complex. After 
washing off the unbound reagent, the remaining tertiary complex is then exposed to the light 
of the appropriate wavelength the fluorescence observed indicates the presence of the hapten 
of interest. Immunofluorescene and EIA techniques are both very well established in the art 

25 and are particularly preferred for the present method. However, other reporter molecules, such 
as radioisotope, chemiluminescent or bioluminescent molecules, may also be employed. 

The present invention also contemplates genetic assays such as involving PGR analysis to 
detect proteinase/kinase gene or its derivatives. Alternative methods or methods used in 
30 conjunction include direct nucleotide sequencing or mutation scanning such as single stranded 
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conformation polymorphoms analysis (SSCP), specific oligonucleotide hybridisation, and 
methods such as direct protein truncation tests. 

The nucleic acid molecules of the present invention may be DNA or RNA. When the nucleic 
5 acid molecule is in DNA form, it may be genomic DNA or cDNA. RNA forms of the nucleic 
acid molecules of the present invention are generally mRNA. 

Although the nucleic acid molecules of the present invention are generally in isolated form, 
they may be integrated into or ligated to or otherwise fused or associated with other genetic 
10 molecules such as vector molecules and in particular expression vector molecules. Vectors 
and expression vectors are generally capable of replication and, if applicable, expression in 
one or both of a prokaryotic cell or a eukaryotic cell. Preferably, prokaryotic cells include E. 
coli, Bacillus sp and Pseudomonas sp. Preferred eukaryotic cells include yeast, fungal, 
manmialian and insect cells. 

15 

Accordingly, another aspect of the present invention contemplates a genetic construct 
comprising a vector portion and a mammalian and more particularly a human 
proteinase/kinase gene portion, which proteinase/kinase gene portion is capable of encoding 
an proteinase/kinase polypeptide or a functional or immunologically interactive derivative 
20 thereof 

Preferably, the proteinase/kinase gene portion of the genetic construct is operably linked to 
a promoter on the vector such that said promoter is capable of directing expression of said 
proteinase/kinase gene portion in an appropriate cell. 

25 

In addition, the proteinase/kinase gene portion of the genetic construct may comprise all or 
part of the gene fused to another genetic sequence such as a nucleotide sequence encoding 
glutathione-S-transferase or part thereof. 

30 The present invention extends to such genetic constructs and to prokaryotic or eukaryotic cells 
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comprising same. 

The present invention also extends to any or all derivatives of proteinase/kinase including 
mutants, part, fragments, portions, homologues and analogues or their encoding genetic 
5 sequence including single or multiple nucleotide or amino acid substitutions, additions and/or 
deletions to the naturally occurring nucleotide or amino acid sequence. The present invention 
further encompasses hybrids between the proteinase/kinases such as to broaden the spectrum 
of activity and to ligands and substrates of the proteinase/kinase. 

10 The proteinase/kinase and its genetic sequence of the present invention will be useful in the 
generation of a range of therapeutic and diagnostic reagents. 

Soluble proteinase/kinase polypeptides or other derivatives, agonists or antagonists are also 
contemplated to be useful in the treatment of disease, injury or abnormality in the nervous 

15 system, e.g. in relation to central or peripheral nervous system to treat Cerebral Palsy, trauma 
induced paralysis, vascular ischaemia associated with stroke, neuronal tumours, motoneurone 
disease, Parkinson's disease, Huntington's disease, Alzheimer's disease. Multiple Sclerosis, 
peripheral neuropathies associated with diabetes, heavy metal or alcohol toxicity, renal failure 
and infectious diseases such as herpes, mbella, measles, chicken pox, HIV or HTLV-1 . Other 

20 conditions for which the proteinase/kinase are useful include cancer, metastasis and 
autoimmune disease amongst many others. 

A further aspect of the present invention contemplates the use of proteinase/kinase or its 
functional derivatives in the manufacture of a medicament for the treatment of 
25 proteinase/kinase mediated conditions defective or deficient. 

The present invention is further described by the following non-limiting Figures and 
Examples. 

30 In the Figures: 
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Figure 1 is a representation showing (A) schematic and (B) hydrophobicity plot of the 
HELA2 amino acid sequence. 

Figure 2 is a photograph representation showing Northern blot analysis of HELA2. 

5 

Total RNA was isolated from HeLa cells and HeLa cells stably transfected with a PA 1-2 
expressing construct (Sla cells). Sla cells had been treated with TNF and cycloheximide 
for the times shown. Ten fxg of total RNA was eletrophoresed on a denaturing agarose gel, 
transferred to Hybond N and fixed by UV irradiation. DNA for probing was recovered by 
10 EcoRl digestion of the pGEM-T cloned RT-PCR product (450 bp). Radiolabelling was 
performed by the random priming method. Blots were washed to a stringency of 0.2 x SSC 
and 0.1% w/v SDS. 

Figm"e 3 is a photographic representation showing Northern analysis of BCON3. The 
15 method is as described in the legend for Figure 2. 
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EXAMPLE 1 
CLONING PROCEDURES 

In order to identify serine proteinases that may be involved in regulatory cellular functions, 
5 a genetic screening approach was applied using degenerate primers corresponding to 
conserved regions of serine proteinases (amino acids flanking His- and Ser- residues) to 
amplify gene fragments spanning these regions from cDNA, using a low stringency RT-PCR 
(Reverse Transcriptase-Polymerase Chain Reaction) approach. 

10 By this technique, the aim was to isolate low abundance genes as well as those present in 
moderate to high abundance. The cDNA used for these experiments was isolated from a 
HeLa cell cytotoxicity model wherein PAI-2 expression inhibits TNFa-induced apoptosis 
(Dickinson, etalJ. Biol. Chem. 270, 27894-27904, 1995). These PAI-2 expressmg cells 
provide a unique and viable system for investigating TNF signalling pathways as they are 

15 protected from the cytotoxic effects of TNFa. 

cDNA was generated from RNA isolated from PAI-2 expressing HeLa cells, untreated and 
following treatment with TNF and cycloheximide. Amplification of both cDNA populations 
using PCR and the following serine proteinase degenerate primers, 

20 

His Primer: 5'ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID NO:l], 
Ser Pruner: 5'ACACTTAAGAXIGGICCICCIC/GT/AXTCICC3' [SEQ ID NO:2] 

produced DNA fragments in the range of 480bp, the approximate predicted size of the serine 
25 proteinase intergenic region. These amplified DNA fragments were cloned into E. coli 
generating a library containing ^^proximately 150 independent clones. We have analysed 36 
of these clones were analysed and found that 9 encode the previously identified serine 
proteinases, tissue-type or urokinase-type plasminogen activators, thereby demonstrating the 
efficacy of this approach. Of the other 36, two were found to encode novel open reading 
30 frames with high homology to serine proteinases and are referred to herein as -HELA2*' and 
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"ATC2", One additional clone, "BCON3", designated herein showed homology to the 
family of kinases. Extension of the DNA fragments by RACE in both 5' and 3' directions 
using internally derived primers has verified their homology to the serine proteinase family. 
The DNA sequences are unique in that they are markedly different from any known DNA 
5 or protein sequence in the Genbank and NBRF databases. 

EXAMPLE 2 
HELA2 SERINE PROTEINASE 

10 The HELA2 mRNA transcript is approximately 1.5kb as approximated from Northern blots. 
Nucleic acid sequence was obtained for 1.1 kb of HELA2 which spans the entire coding 
region, the 3' noncoding region and part of the 5' noncoding region. The coding region 
starts with an ATG codon which is present in a motif analogous to the Kozak eukaryotic 
translation initiation consensm sequence. Alignment of the deduced amino acid sequence of 

15 HELA2 with homologous serine proteinases shows that the cDNA encodes a 314aa 
polypeptide with a calculated molecular weight of 34.8kD, which is synthesized as a 
zymogen containing pre-, pro- and catalytic regions. The pro- region (or light chain) and the 
catalytic region (heavy chain) are delineated by a classic serine proteinase activation motif 
Arg-Ile-Val-Gly-Gly with cleavage likely occurring between Arg and He. The catalytic 

20 region includes the catalytic triad of His, Asp and Ser in positions and motifs which are 
highly conserved among the serine proteinases. Ten Cys residues occur in conserved 
positions: by analogy to other serine proteinases, eight of these function to form disulfide 
bridges within the catalytic region and the remaining two link the pro- and catalytic regions. 

25 Structural features conserved in the binding pockets of serine proteinases are present in 
HELA2. An Asp residue at the bottom of the serine proteinase binding pocket six residues 
before the active site Ser in HELA2 indicates that HELA2 has trypsin-like specificity, with 
proteolytic cleavage after Arg or Lys in target substrates. HELA2 also contains a conserved 
Ser-Trp-Gly motif at the top of the binding pocket which is involved in hydrogen bonding 

30 with target substrates in other serine proteinases. Two isoforms of HELA2 were identified 
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in a HeLa cell cDNA library (Stratagene UniZap HeLa Library) which differ by an insertion 
of 2aa (Tyr-Ser) within the catalytic binding pocket. At the DNA level there is a 
corresponding insertion of 6 nucleotides which generates a Sfil restriction enzyme site. 

5 A hydrophobicity plot of the HELA2 amino acid sequence (Figure 1) identifies two 
hydrophobic regions, one located at the amino terminus and the other at the carboxy 
terminus. The 20aa amino terminal hydrophobic region is likely to be a signal peptide, 
which could direct newly synthesized HELA2 to enter the endoplasmic reticulum. The 16aa 
hydrophobic carboxy terminus of HELA2 aligns with the transmembrane domain of 
10 prostasin, suggesting that HELA2 is likely to be a membrane-anchored serine proteinase. In 
prostasin, this protruding carboxy terminus may be cleaved, thus releasing the serine 
proteinase from the membrane. A unique feature of HELA2 is the presence of a putative 
mitochondrial localisation sequence directly after the signal peptide sequence. 

15 Preliminary Northern blot data show (Figure 2) that HELA2 mRNA is expressed in the 
epithelial-like HeLa cell, but is not present in the monocyte-like U937 and MonoMac6 cells, 
suggesting that HELA2 expression demonstrates a degree of cell-type specificity and 
therefore is likely to have a specific physiological function. 

20 The HELA2 cDNA was cloned in two isoforms, a short isoform and a long isoform. The 
nucleotide and corresponding amino acid sequence for the short isoform of HELA2 is shown 
in SEQ ID NOs. 3 and 4, respectively. The long isoform is shown in SEQ ID NO:5 and 
6, respectively. 

25 EXAMPLE 2 

ATC2 SERINE PROTEINASE 

ATC2 was isolated from the cDNA of PAI-2 expressing HeLa cells following treatment with 
TNF and cycloheximide. A partial DNA sequence for ATC2 cDNA has been obtained which 
30 encompasses the sequence encoding the serine proteinase catalytic region. Additional clones 
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extending to both 5' and 3' directions have been obtained. The available nucleic acid 
sequence of ATC2 cDNA and its deduced amino acid sequence shows that it is a member of 
the serine proteinase family with homology to hepsin, prostasin, and acrosin. It thus may 
belong to the same family as HELA2, The catalytic region includes the His, Asp and Ser 
5 conserved motifs. Preliminary Northern blot experiments have failed to detect ATC2 inRNA 
in total RNA isolated from resting HeLa cells, suggesting that it may not be expressed in 
abundance in these cells, which may mean that it is tightly regulated. As it was isolated from 
cells following treatment with TNF and cycloheximide, its expression may be induced by 
these agents in HeLa cells. These data have potential significance for a role for ATC2 in 
10 apoptosis and cell death. ATC2 may be intracellular, extracellular or found on the cell surface 
and is likely to be involved in regulating cell functions. Thus ATC2 may have potential 
significance in the treatment of cancer and diseases involving dysregulation of cell growth and 
survival. The nucleotide and corresponding amino acid sequence of ATC2 is shown in SEQ 
ED NOs: 7 and 8, respectively. 

15 

EXAMPLE 4 
BCON3 

The deduced amino acid sequence of BCON3 (SEQ ID NO: 10) reveals that it is novel. 

20 At both the DNA and protein level, BC0N3 shows homology to members of the kinase 
family of proteins. Although it cannot be classified as a member of any particular sub- 
family of kinases, alignments of the BCON3 protein with the conserved domains of 
thymidine kinases and tyrosine and serine/threonine protein kinases indicates possible 
ATP/GTP binding and phosphate transfer regions. Thus, it may be the first member of 

25 a new family of kinases. Analysis of the translation product using hydrophobicity plots 
and the Prosite protein analysis algorithms indicates BCON3 may lack an N-terminal 
signal sequence (that is, it is likely to encode an intracellular protein) and it possess a 
nuclear localization signal. BCON3 mRNA is approximately 2300 nucleotides in length. 
cDNA sequence (SEQ ID N0:9) has been obtained covering about 95% of the 

30 transcript and including the 3' polyA tail. BCON3 mRNA is expressed in normal HeLa 
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cells, as well as in HeLa cell clones that express PAI-2 (Figure 3). 

Those skilled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specification, individually or collectively, and any and all 
combinations of any two or more of said steps or features. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: THE COUNCIL OF THE QUEENSLAND INSTITUTE OF 
MEDICAL RESEARCH 

(ii) TITLE OF INVENTION: NOVEL MOLECULES 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: AUSTRALIAN PROVISIONAL 

(B) FILING DATE: 13-FEB-1997 

(C) CLASSIFICATION: 
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(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: HUGHES, DR E JOHN L 
(C) REFERENCE/DOCKET NUMBER: EJH/AF 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 3 9254 2777 

(B) TELEFAX: +61 3 9254 2770 

(C) TELEX: AA 31787 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 
ACAGAATTCT GGGTIGTIAC IGCIGCICAY TG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
ACACTTAAGA XIGGICCICC IC/GT/AXTCICC 
(2) INFOPJ^TION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 965 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CGCGGGAGAG GAGGCC ATG GGC GCG CGC GGG GCG CTG CTG CTG GCG CTG 49 
Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu 
15 10 

CTG CTG GCT CGG GCT GGA CTC AGG AAG CCG GAG TCG CAG GAG GCG GCG 97 
Leu Leu Ala Arg Ala Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala 
15 20 25 

CCG TTA TCA GGA CCA TGC GGC CGA CGG GTC ATC ACG TCG CGC ATC GTG 145 
Pro Leu Ser Gly Pro Cys Gly Arg Arg Val lie Thr Ser Arg lie Val 
30 35 40 

GGT GGA GAG GAC GCC GAA CTC GGG CGT TGG CCG TGG CAG GGG AGC CTG 193 
Gly Gly Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu 
45 50 55 

CGC CTG TGG GAT TCC CAC GTA TGC GGA GTG AGC CTG CTC AGC CAC CGC 241 
Arg Leu Trp Asp Ser His Val Cys Gly Val Ser Leu Leu Ser His Arg 
60 65 70 , 75 

TGG GCA CTC ACG GCG GCG CAC TGC TTT GAA ACT GAC CTT AGT GAT CCC 28 9 

Trp Ala Leu Thr Ala Ala His Cys Phe Glu Thr Asp Leu Ser Asp Pro 
80 85 90 



TCC GGG TGG ATG GTC CAG TTT GGC CAG CTG ACT TCC ATG CCA TCC TTC 
Ser Gly Trp Met Val Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe 
95 100 105 



337 
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TGG AGC CTG CAG GCC TAG TAG ACC CGT TAG TTG GTA TGG AAT ATG TAT 385 
Trp Ser Leu Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn lie Tyr 
110 115 120 

CTG AGC CCT CGC TAG CTG GGG AAT TCA CCC TAT GAC ATT GCC TTG GTG 433 
Leu Ser Pro Arg Tyr Leu Gly Asn Ser Pro Tyr Asp lie Ala Leu Val 
125 130 135 

AAG CTG TCT GCA CCT GTC ACC TAG ACT AAA CAC ATC CAG CCC ATC TGT 481 
Lys Leu Ser Ala Pro Val Thr Tyr Thr Lys His lie Gin Pro lie Cys 
140 145 150 155 

CTC CAG GCC TCC ACA TTT GAG TTT GAG ?^C CGG AC A GAC TGC TGG GTG 52 9 

Leu Gin Ala Ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp Val 
160 165 170 

ACT GGC TGG GGG TAG ATC AAA GAG GAT GAG GCA CTG CCA TCT CCC CAC 577 
Thr Gly Trp Gly Tyr lie Lys Glu Asp Glu Ala Leu Pro Ser Pro His 
175 180 185 

ACC CTC CAG GAA GTT CAG GTC GCC ATC ATA AAC AAC TCT ATG TGC AAC 625 
Thr Leu Gin Glu Val Gin Val Ala lie lie Asn Asn Ser Met Cys Asn 
190 195 200 

CAC CTC TTG CTC AAG TAG AGT TTG CGC AAG GAC ATC TTT GGA GAC ATG 673 
His Leu Phe Leu Lys Tyr Ser Phe Arg Lys Asp lie Phe Gly Asp Met 
205 210 215 

GTT TGT GCT GGC AAT GCC CAA GGC GGG AAG GAT GCC TGC TTC GGT GAC 721 
Val Cys Ala Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp 
220 225 230 235 

TCA GGT GGA CCC TTG GCC TGT AAC AAG GAT GGA CTG TGG TAT CAG ATT 76 9 

Ser Gly Gly Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr Gin lie 
240 245 250 

GGA GTC GTG AGC TGG GGA GTG GGC TGT GGT CGG CCC AAT CGG CCC GGT 817 
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Gly Val Val Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg Pro Gly 
255 260 265 

GTC TAG AGO AAT ATC AGC CAC GAG TTT GAG TGG ATC GAG AAG CTG ATG 86 5 

Val Tyr Thr Asn lie Ser His His Phe Glu Trp lie Gin Lys Leu Met 
270 275 280 

GCC CAG AGT GGC ATG TCC GAG CCA GAG CCC TCC TGG CCG CTA CTC TTT 913 
Ala Gin Ser Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe 
285 290 295 

TTC CCT CTT CTC TGG GCT CTC CCA CTC CTG GGG CCG GTC TGA 961 
Phe Pro Leu Leu Trp Ala Leu Pro Leu x^eu Gly Pro Val * 
300 305 310 

GCCTACCTGA GCCCATGCAG CCTGGGGCCA CTGCCAAGTC AGGCCCTGGT TCTCTTCTGT 1015 

CTTGTTTGGT AATAAACACA TTCCAGTTGA TGCCTTGCAG GGCATTTTTC AAAAAAAAAA 1075 

AAAAAAAAAA AAAT^AAAAA 1094 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu Leu Leu Ala Arg Ala 
15 10 15 

Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro 
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20 25 30 

Cys Gly Arg Arg Val lie Thr Ser Arg lie Val Gly Gly Glu Asp Ala 
35 40 45 

Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp Ser 
50 55 60 

His Val Cys Gly Val Ser Leu Leu Ser His Arg Trp Ala Leu Thr Ala 
65 70 75 80 

Ala His Cys Phe Glu Thr Asp Leu Ser Asp Pro Ser Gly Trp Met Val 
85 90 95 

Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe Trp Ser Leu Gin Ala 
100 105 110 

Tyr Tyr Thr Arg Tyr Phe Val Ser Asn lie Tyr Leu Ser Pro Arg Tyr 
115 120 125 

Leu Gly Asn Ser Pro Tyr Asp lie Ala Leu Val Lys Leu Ser Ala Pro 
130 135 140 

Val Thr Tyr Thr Lys His lie Gin Pro lie Cys Leu Gin Ala Ser Thr 
145 150 155 160 

Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp Val Thr Gly Trp Gly Tyr 
165 170 175 

lie Lys Glu Asp Glu Ala Leu Pro Ser Pro His Thr Leu Gin Glu Val 
180 185 190 

Gin Val Ala lie lie Asn Asn Ser Met Cys Asn His Leu Phe Leu Lys 
195 200 205 



Tyr Ser Phe Arg Lys Asp lie Phe Gly Asp Met Val Cys Ala Gly Asn 
210 215 220 
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Ala Gin Gly Gly 
225 

Ala Cys Asn Lys 

Gly Val Gly Cys 
260 

Ser His His Phe 
275 

Ser Gin Pro Asp 
290 

Ala Leu Pro Leu 
305 



Lys Asp Ala Cys 
230 

Asp Gly Leu Trp 
245 

Gly Arg Pro Asn 

Glu Trp lie Gin 
280 

Pro Ser Trp Pro 
295 

Leu Gly Pro Val 
310 
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Phe Gly Asp Ser 
235 

Tyr Gin lie Gly 
250 

Arg Pro Gly Val 
265 

Lys Leu Met Ala 

Leu Leu Phe Phe 
300 

* 



Gly Gly Pro Leu 
240 

Val Val Ser Trp 
255 

Tyr Thr Asn lie 
270 

Gin Ser Gly Met 
285 

Pro Leu Leu Trp 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 961 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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CGCGGGAGAG GAGGCC ATG GGC GCG CGC GGG GCG CTG CTG CTG GCG CTG 

Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu 
15 10 

CTG CTG GCT CGG GCT GGA CTC AGG AAG CCG GAG TCG CAG GAG GCG GCG 
Leu Leu Ala Arg Ala Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala 
15 20 25 

CCG TTA TCA GGA CCA TGC GGC CGA CGG GTC ATC ACG TCG CGC ATC GTG 
Pro Leu Ser Gly Pro Cys Gly Arg Arg Val lie Thr Ser Arg lie Val 
30 35 40 

GGT GGA GAG GAC GCC GAA CTC GGG CGT TGG CCG TGG CAG GGG AGC CTG 
Gly Gly Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu 
45 50 55 

CGC CTG TGG GAT TCC CAC GTA TGC GGA GTG AGC CTG CTC AGC CAC CGC 
Arg Leu Trp Asp Ser His Val Cys Gly Val Ser Leu Leu Ser His Arg 
60 65 70 75 

TGG GCA CTC ACG GCG GCG CAC TGC TTT GAA ACC TAT AGT GAC CTT AGT 
Trp Ala Leu Thr Ala Ala His Cys Phe Glu Thr Tyr Ser Asp Leu Ser 
80 85 90 

GAT CCC TCC GGG TGG ATG GTC CAG TTT GGC CAG CTG ACT TCC ATG CCA 
Asp Pro Ser Gly Trp Met Val Gin Phe Gly Gin Leu Thr Ser Met Pro 
95 100 105 

TCC TTC TGG AGC CTG CAG GCC TAC TAC ACC CGT TAC TTC GTA TCG AAT 
Ser Phe Trp Ser Leu Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn 
110 115 120 

ATC TAT CTG AGC CCT CGC TAC CTG GGG AAT TCA CCC TAT GAC ATT GCC 
lie Tyr Leu Ser Pro Arg Tyr Leu Gly Asn Ser Pro Tyr Asp lie Ala 
125 130 135 



TTG GTG AAG CTG TCT GCA CCT GTC ACC TAC ACT AAA CAC ATC CAG CCC 
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Leu Val Lys Leu Ser Ala Pro Val Thr Tyr Thr Lys His lie Gin Pro 
140 145 150 155 

ATC TGT CTC CAG GCC TCC ACA TTT GAG TTT GAG AAC CGG ACA GAC TGC 52 9 

lie Cys Leu Gin Ala Ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys 
160 165 170 

TGG GTG ACT GGC TGG GGG TAG ATC AAA GAG GAT GAG GCA CTG CCA TCT 577 
Trp Val Thr Gly Trp Gly Tyr lie Lys Glu Asp Glu Ala Leu Pro Ser 
175 180 185 

CCC CAC ACC CTC CAG GAA GTT CAG GTC GCC ATC ATA AAC AAC TCT ATG 625 
Pro His Thr Leu Gin Glu Val Gin Val Ala lie lie Asn Asn Ser Met 
190 195 200 

TGC AAC CAC CTC TTC CTC AAG TAC AGT TTC CGC AAG GAC ATC TTT GGA 673 
Cys Asn His Leu Phe Leu Lys Tyr Ser Phe Arg Lys Asp lie Phe Gly 
205 210 215 

GAC ATG GTT TGT GCT GGC AAT GCC CAA GGC GGG AAG GAT GCC TGC TTC 721 
Asp Met Val Cys Ala Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe 
220 225 230 235 

GGT GAC TCA GGT GGA CCC TTG GCC TGT AAC AAG GAT GGA CTG TGG TAT 76 9 

Gly Asp Ser Gly Gly Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr 
240 245 250 

CAG ATT GGA GTC GTG AGC TGG GGA GTG GGC TGT GGT CGG CCC AAT CGG 817 
Gin lie Gly Val Val Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg 
255 260 265 

CCC GGT GTC TAC ACC AAT ATC AGC CAC CAC TTT GAG TGG ATC CAG AAG 8 65 

Pro Gly Val Tyr Thr Asn lie Ser His His Phe Glu Trp He Gin Lys 
270 275 280 



CTG ATG GCC CAG AGT GGC ATG TCC CAG CCA GAC CCC TCC TGG CCG CTA 
Leu Met Ala Gin Ser Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu 



913 
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285 290 295 

CTC TTT TTC CCT CTT CTC TGG GCT CTC CCA CTC CTG GGG CCG GTC TGAGCCTACC 
968 

Leu Phe Phe Pro Leu Leu Trp Ala Leu Pro Leu Leu Gly Pro Val 
300 305 310 315 

TGAGCCCATG CAGCCTGGGG CCACTGCCAA GTCAGGCCCT GGTTCTCTTC TGTCTTGTTT 1028 

GGTAATAAAC ACATTCCAGT TGATGCCTTG CAGGGCATTT TTCAAAAAAA AAAAAAAAAA 1088 

AAAAAAAAAA AA 1100 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu Leu Leu Ala Arg Ala 
15 10 15 

Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro 
20 25 30 

Cys Gly Arg Arg Val lie Thr Ser Arg lie Val Gly Gly Glu Asp Ala 
35 40 45 

Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp Ser 
50 55 60 
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His Val Cys Gly 
65 

Ala His Cys Phe 

Met Val Gin Phe 
100 

Gin Ala Tyr Tyr 
115 

Arg Tyr Leu Gly 
130 

Ala Pro Val Thr 
145 

Ser Thr Phe Glu 



Gly Tyr lie Lys 
180 

Glu Val Gin Val 
195 

Leu Lys Tyr Ser 
210 

Gly Asn Ala Gin 
225 

Pro Leu Ala Cys 



Val Ser Leu Leu 
70 

Glu Thr Tyr Ser 
85 

Gly Gin Leu Thr 

Thr Arg Tyr Phe 
120 

Asn Ser Pro Tyr 
135 

Tyr Thr Lys His 
150 

Phe Glu Asn Arg 
165 

Glu Asp Glu Ala 

Ala lie lie Asn 
200 

Phe Arg Lys Asp 
215 

Gly Gly Lys Asp 
230 

Asn Lys Asp Gly 
245 
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Ser His Arg Trp 
75 

Asp Leu Ser Asp 
90 

Ser Met Pro ^Ser 
105 

Val Ser Asn lie 

Asp lie Ala Leu 
140 

lie Gin Pro lie 
155 

Thr Asp Cys Trp 
170 

Leu Pro Ser Pro 
185 

Asn Ser Met Cys 

lie Phe Gly Asp 
220 

Ala Cys Phe Gly 
235 

Leu Trp Tyr Gin 
250 



Ala Leu Thr Ala 
80 

Pro Ser Gly Trp 
95 

Phe Trp Ser Leu 
110 

Tyr Leu Ser Pro 
125 

Val Lys Leu Ser 

Cys Leu Gin Ala 
160 

Val Thr Gly Trp 
175 

His Thr Leu Gin 
190 

Asn His Leu Phe 
205 

Met Val Cys Ala 

Asp Ser Gly Gly 
240 

He Gly Val Val 
255 



Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg Pro Gly Val Tyr Thr 
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260 



265 



270 



Asn lie Ser His His Phe Glu Trp lie Gin Lys Leu Met Ala Gin Ser 



275 



280 



285 



Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe Phe Pro Leu 



290 



295 



300 



Leu Trp Ala Leu Pro Leu Leu Gly Pro Val 



305 



310 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 24.. 799 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGTTCAGATG AATGGGACTG TGA GAA CCA TCT GTG ACC MiPi TTG ATA CAG 50 



Glu Pro Ser Val Thr Lys Leu lie Gin 



1 



5 



GAA CAG GAG 7^ GAG CCG CGG TGG CTG ACA TTA CAC TCC AAC TGG GAG 
Glu Gin Glu Lys Glu Pro Arg Trp Leu Thr Leu His Ser Asn Trp Glu 
10 15 20 25 



98 
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AGC CTC AAT GGG ACC ACT TTA CAT GAA CTT GTA GTA AAT GGG CAG TCT 146 
Ser Leu Asn Gly Thr Thr Leu His Glu Leu Val Val Asn Gly Gin Ser 
30 35 40 

TGT GAG AGC AGA AGT AAA ATT TCT CTT CTG TGT ACT AAA CAA GAC TGT 194 
Cys Glu Ser Arg Ser Lys lie Ser Leu Leu Cys Thr Lys Gin Asp Cys 
45 50 55 

GGG CGC CGC CCT GCT GCC CGA ATG AAC AAA AGG ATC CTT GGA GGT CGG 242 
Gly Arg Arg Pro Ala Ala Arg Met Asn Lys Arg lie Leu Gly Gly Arg 
60 65 70 

ACG AGT CGC CCT GGA AGG TGG CCA TGG Cr\G TGT TCT CTG CAG AGT GAA 2 90 

Thr Ser Arg Pro Gly Arg Trp Pro Trp Gin Cys Ser Leu Gin Ser Glu 
75 80 85 

CCC AGT GGA CAT ATC TGT GGC TGT GTC CTC ATT GCC AAG AAG TGG GTT 338 
Pro Ser Gly His lie Cys Gly Cys Val Leu lie Ala Lys Lys Trp Val 
90 95 100 105 

GTG ACA GTT GCC CAC TGC TTC GAG GGG AGA GAG AAT GCT GCA GTT TGG 386 
Val Thr Val Ala His Cys Phe Glu Gly Arg Glu Asn Ala Ala Val Trp 
110 115 120 

AAA GTG GTG CTT GGC ATC AAC AAT CTA GAC CAT CCA TCA GTG TTC ATG 434 
Lys Val Val Leu Gly lie Asn Asn Leu Asp His Pro Ser Val Phe Met 
125 130 135 

CAG ACA CGC TTT GTG AGG ACC ATC ATC CTG CAT CCC CGC TAC AGT CGA 482 
Gin Thr Arg Phe Val Arg Thr lie lie Leu His Pro Arg Tyr Ser Arg 
140 145 150 

GCA GTG GTG GAC TAT GAC ATC AGC ATC GTT GAG CTG AGT GAA GAC ATC 53 0 

Ala Val Val Asp Tyr Asp He Ser He Val Glu Leu Ser Glu Asp He 
155 160 165 

AGT GAG ACT GGC TAC GTC CGG CCT GTC TGC TTG CCC AAC CCG GAG CAG 578 
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Ser Glu Thr Gly Tyr Val Arg Pro Val Cys Leu Pro Asn Pro Glu Gin 
170 175 180 185 

TGG CTA GAG CCT GAC ACG TAG TGC TAT ATC ACA GGC TGG GGC CAC ATG 62 6 

Trp Leu Glu Pro Asp Thr Tyr Cys Tyr lie Thr Gly Trp Gly His Met 
190 195 200 

GGC AAT AAA ATG CCA TTT AAG CTG CAA GAG GGA GAG GTC CGC ATT ATT 674 
Gly Asn Lys Met Pro Phe Lys Leu Gin Glu Gly Glu Val Arg lie lie 
205 210 215 

TCT CTG GAA CAT TGT CAG TCC TAC TTT GAC ATG AAG ACC ATC ACC ACT 722 
. Ser Leu Glu His Cys Gin Ser Tyr Phe Asp Met Lys Thr lie Thr Thr 
220 225 230 

CGG ATG ATA TGT GCT GGC TAT GAG TCT GGC ACA GTT GAT TCA TGC ATG 770 
Arg Met lie Cys Ala Gly Tyr Glu Ser Gly Thr Val Asp Ser Cys Met 
235 240 245 

GGT GAC TGG GGC GGT CCG TTG AAT TCT GT 799 
Gly Asp Trp Gly Gly Pro Leu Asn Ser 
250 255 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Glu Pro Ser Val Thr Lys Leu lie Gin Glu Gin Glu Lys Glu Pro Arg 
15 10 15 



P:\OPER\EIH\NOVELM.PRV- 13/2/97 



46- 



Trp Leu Thr Leu His Ser Asn Trp Glu Ser Leu Asn Gly Thr Thr Leu 
20 25 30 

His Glu Leu Val Val Asn Gly Gin Ser Cys Glu Ser Arg Ser Lys lie 
35 40 45 

Ser Leu Leu Cys Thr Lys Gin Asp Cys Gly Arg Arg Pro Ala Ala Arg 
50 55 60 

Met Asn Lys Arg lie Leu Gly Gly Arg Thr Ser Arg Pro Gly Arg Trp 
65 70 75 80 

Pro Trp Gin Cys Ser Leu Gin Ser Glu Pro Ser Gly His lie Cys Gly 
85 90 95 

Cys Val Leu lie Ala Lys Lys Trp Val Val Thr Val Ala His Cys Phe 
100 105 110 

Glu Gly Arg Glu Asn Ala Ala Val Trp Lys Val Val Leu Gly lie Asn 
115 120 125 

Asn Leu Asp His Pro Ser Val Phe Met Gin Thr Arg Phe Val Arg Thr 
130 135 140 

lie lie Leu His Pro Arg Tyr Ser Arg Ala Val Val Asp Tyr Asp lie 
145 150 155 160 

Ser lie Val Glu Leu Ser Glu Asp lie Ser Glu Thr Gly Tyr Val Arg' 
165 170 175 

Pro Val Cys Leu Pro Asn Pro Glu Gin Trp Leu Glu Pro Asp Thr Tyr 
180 185 190 

Cys Tyr lie Thr Gly Trp Gly His Met Gly Asn Lys Met Pro Phe Lys 
195 200 205 



Leu Gin Glu Gly Glu Val Arg lie He Ser Leu Glu His Cys Gin Ser 
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210 215 220 

Tyr Phe Asp Met Lys Thr lie Thr Thr Arg Met lie Cys Ala Gly Tyr 
225 230 235 240 

Glu Ser Gly Thr Val Asp Ser Cys Met Gly Asp Trp Gly Gly Pro Leu 
245 250 255 

Asn Ser 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 166.. 1773 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATTTAATACG ACTCACTATA GGGAATTTGG CCCTCGAGGA AGAATTCGGC ACGAGGCTGC 6 0 

GGCGCACTGT GAGGGAGTCG CTGTGATCCG GGGCCCCGAA CCCGACTGGA GCTGAAGCGC 120 

AGGCTGCGGG GCGCGGAGTC GGGAGGCCTG AGTGTTCCTT CCAGC ATG TCG GAG 174 

Met Ser Glu 
1 
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GGG GAG TCC CAG ACA GTA CTT AGC AGT GGC TCA GAG CCA AAG GTA GAA 222 
Gly Glu Ser Gin Thr Val Leu Ser Ser Gly Ser Asp Pro Lys Val Glu 
5 10 15 

TCT TCA TCT TCA GCT CCT GGC CTG ACA TCA GTG TCA CCT CCT GTG ACC 270 
Ser Ser Ser Ser Ala Pro Gly Leu Thr Ser Val Ser Pro Pro Val Thr 
20 25 30 35 

TCC ACA ACC TCA GCT GCT TCC CCA GAG GAA GAA GAA GAA AGT GAA GAT 318 
Ser Thr Thr Ser Ala Ala Ser Pro Glu Glu Glu Glu Glu Ser Glu Asp 
40 45 50 

GAG TCT GAG ATT TTG GAA GAG TCG CCC TGI GGG CGC TGG CAG AAG AGG 366 
Glu Ser Glu lie Leu Glu Glu Ser Pro Cys Gly Arg Trp Gin Lys Arg 
55 60 65 

CGA GAA GAG GTG AAT CAA CGG AAT GTA CCA GGT ATT GAC AGT GCA TAC 414 
Arg Glu Glu Val Asn Gin Arg Asn Val Pro Gly lie Asp Ser Ala Tyr 
70 75 80 

CTG GCC ATG GAT ACA GAG GAA GGT GTA GAG GTT GTG TGG AAT GAG GTA 462 
Leu Ala Met Asp Thr Glu Glu Gly Val Glu Val Val Trp Asn Glu Val 
85 90 95 

CAG TTC TCT GAA CGC AAG AAC TAC AAG CTG CAG GAG GAA AAG GTT TGT 510 
Gin Phe Ser Glu Arg Lys Asn Tyr Lys Leu Gin Glu Glu Lys Val Cys 
100 105 110 115 

GCT GTG TTT GAT AAT TTG ATT CAA TTG GAG CAT CTT AAC ATT GTT AAG "558 
Ala Val Phe Asp Asn Leu lie Gin Leu Glu His Leu Asn lie Val Lys 
120 125 130 

TTT CAC AAA TAT TGG GCT GAC ATT AAA GAG AAC AAG GCC AGG GTC ATT 6 06 

Phe His Lys Tyr Trp Ala Asp lie Lys Glu Asn Lys Ala Arg Val lie 
135 140 145 



TTT ATC ACA GGA TAC ATG TCA TCT GGG AGT CTG AAG CAA TTT CTG AAG 



654 
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Phe lie Thr Gly Tyr Met Ser Ser Gly Ser Leu Lys Gin Phe Leu Lys 
150 ^ 155 160 

AAG ACC CAA AAG AAC CAC CAG ACG ATG AAT GAA AAG GCA TGG AAG CGT 7 02 

Lys Thr Gin Lys Asn His Gin Thr Met Asn Glu Lys Ala Trp Lys Arg 
165 170 175 

TGG TGC ACA CAA ATC CTC TCT GCC CTA AGC TAG CTG CAC TCC TGT GAC 750 
Trp Cys Thr Gin lie Leu Ser Ala Leu Ser Tyr Leu His Ser Cys Asp 
180 185 190 195 

CCC CCC ATC ATC CAT GGG AAC CTG ACC TGT GAC ACC ATC TTC ATC CAG 798 
Pro Pro lie lie His Gly Asn Leu Thr Cys Asp Thr lie Phe lie Gin 
200 205 210 

CAC AAC GGA CTC ATC AAG ATT GGC TCT GTG GCT CCT GAC ACT ATC AAC 84 6 

His Asn Gly Leu lie Lys lie Gly Ser Val Ala Pro Asp Thr lie Asn 
215 220 225 

AAT CAT GTG AAG ACT TGT CGA GAA GAG CAG AAG AAT CTA CAC TTC TTT 894 
Asn His Val Lys Thr Cys Arg Glu Glu Gin Lys Asn Leu His Phe Phe 
230 235 240 

GCA CCA GAG TAT GGA GAA GTC ACT AAT GTG ACA ACA GCA GTG GAC ATC 942 
Ala Pro Glu Tyr Gly Glu Val Thr Asn Val Thr Thr Ala Val Asp lie 
245 250 255 

TAC TCC TTT GGC ATG TGT GCA CTG GGG ATG GCA GTG CTG GAG ATT CAG 990 
Tyr Ser Phe Gly Met Cys Ala Leu Gly Met Ala Val Leu Glu lie Gin 
260 265 270 275 

GGC AAT GGA GAG TCC TCA TAT GTG CCA CAG GAA GCC ATC AGC AGT GCC 1038 
Gly Asn Gly Glu Ser Ser Tyr Val Pro Gin Glu Ala lie Ser Ser Ala 
280 285 290 

ATC CAG CTT CTA GAA GAC CCA TTA CAG AGG GAG TTC ATT CAA AAG TGC 1086 
lie Gin Leu Leu Glu Asp Pro Leu Gin Arg Glu Phe lie Gin Lys Cys 
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295 300 305 

CTG CAG TCT GAG CCT GCT CGC AGA CCA ACA GCC AGA GAA CTT CTG TTC 1134 
Leu Gin Ser Glu Pro Ala Arg Arg Pro Thr Ala Arg Glu Leu Leu Phe 
310 315 320 

CAC CCA GCA TTG TTT GAA GTG CCC TCG CTC AAA CTC CTT GCG GCC CAC 1182 
His Pro Ala Leu Phe Glu Val Pro Ser Leu Lys Leu Leu Ala Ala His 
325 330 335 

TGC ATT GTG GGA CAC CAA CAC ATG ATC CCA GAG AAC GCT CTA GAG GAG 123 0 

Cys lie Val Gly His Gin His Met He Pro Glu Asn Ala Leu Glu Glu 
340 345 350 355 

ATC ACC AAA AAC ATG GAT ACT AGT GCC GTA CTG GCT GAA ATC CCT GCA 1278 
He Thr Lys Asn Met Asp Thr Ser Ala Val Leu Ala Glu He Pro Ala 
360 365 370 

GGA CCA GGA AGA GAA CCA GTT CAG ACT TTG. TAG TCT CAG TCA CCA GCT 1326 
Gly Pro Gly Arg Glu Pro Val Gin Thr Leu Tyr Ser Gin Ser Pro Ala 
375 380 385 

CTG GAA TTA GAT AAA TTC CTT GAA GAT GTC AGG AAT GGG ATC TAT CCT 1374 
Leu Glu Leu Asp Lys Phe Leu Glu Asp Val Arg Asn Gly He Tyr Pro 
390 395 400 

CTG ACA GCC TTT GGG CTG CCT CGG CCC CAG CAG CCA CAG CAG GAG GAG 1422 
Leu Thr Ala Phe Gly Leu Pro Arg Pro Gin Gin Pro Gin Gin Glu Glu 
405 410 415 

GTG ACA TCA CCT GTC GTG CCC CCC TCT GTC AAG ACT CCG ACA CCT GAA 1470 
Val Thr Ser Pro Val Val Pro Pro Ser Val Lys Thr Pro Thr Pro Glu 
420 425 430 435 



CCA GCT GAG GTG GAG ACT CGC AAG GTG GTG CTG ATG CAG TGC 7VAC ATT 
Pro Ala Glu Val Glu Thr Arg Lys Val Val Leu Met Gin Cys Asn He 
440 445 450 



1518 



P:\OPER\EJH\NOVELM.PRV - 13/2/57 



-51 



GAG TCG GTG GAG GAG GGA GTC AAA CAC CAC CTG ACA CTT CTG CTG AAG 1566 
Glu Ser Val Glu Glu Gly Val Lys His His Leu Thr Leu Leu Leu Lys 
455 460 465 

TTG GAG GAG AAA CTG AAC CGG CAC CTG AGC TGT GAC CTG ATG CCA AAT 1614 
Leu Glu Asp Lys Leu Asn Arg His Leu Ser Cys Asp Leu Met Pro Asn 
470 475 480 

GAG AAT ATC CCC GAG TTG GCG GCT GAG CTG GTG CAG CTG GGC TTC ATT 16 62 

Glu Asn lie Pro Glu Leu Ala Ala Glu Leu Val Gin Leu Gly Phe lie 
485 490 495 

AGT GAG GCT GAC CAG AGC CGG TTG ACT TCT CTG CTA GAA GAG ACC TTG 1710 
Ser Glu Ala Asp Gin Ser Arg Leu Thr Ser Leu Leu Glu Glu Thr Leu 
500 505 510 515 

AAC AAG TTC AAT TTT GCC AGG AAC AGT ACC CTC AAC TCA GCC GCT GTC 1758 
Asn Lys Phe Asn Phe Ala Arg Asn Ser Thr Leu Asn Ser Ala Ala Val 
520 525 530 

ACC GTC TCC TCT TAGAGCTCAC TCGGGCCAGG CCCTGATCTG CGCTGTGGCT 1810 
Thr Val Ser Ser 
535 

GTCCCTGGAC GTGCTGCAGC CCTCCTGTCC CTTCCCCCCA GTCAGTATTA CCCTGTGAAG 1870 

CCCCTTCCCT CCTTTATTAT TCAGGAGGGC TGGGGGGGCT CCCTGGTTCT GAGCATCATC 193 0 

CTTTCCCCTC CCCTCTCTTC CTCCCCTCTG CACTTTGTTT ACTTGTTTTG CACAGACGTG 19 90 

GGCCTGGGCC TTCTCAGCAG CCGCCTTCTA GTTGGGGGCT AGTCGCTGAT CTGCCGGCTC 2050 

CCGCCCAGCC TGTGTGGAAA GGAGGCCCAC GGGCACTAGG GGAGCCGAAT TCTACAATCC 2110 

CGCTGGGGCG GCCGGGGCGG GAGAGAAAGG TGGTGCTGCA GTGGTGGCCC TGGGGGGCCA 2170 

TTCGATTCGC CTCAGTTGCT GCTGTAATAA AAGTCTACTT TTTGCTAAAA AAAAAAAAAA 223 0 
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AAAAAAAAAA A 2241 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEC ID NO:10: 

Met Ser Glu Gly Glu Ser Gin Thr Val Leu Ser Ser Gly Ser Asp Pro 
15 10 15 

Lys Val Glu Ser Ser Ser Ser Ala Pro Gly Leu Thr Ser Val Ser Pro 
20 25 30 

Pro Val Thr Ser Thr Thr Ser Ala Ala Ser Pro Glu Glu Glu Glu Glu 
35 40 45 

Ser Glu Asp Glu Ser Glu lie Leu Glu Glu Ser Pro Cys Gly Arg Trp 
50 55 60 

Gin Lys Arg Arg Glu Glu Val Asn Gin Arg Asn Val Pro Gly lie Asp 
65 70 75 80 

Ser Ala Tyr Leu Ala Met Asp Thr Glu Glu Gly Val Glu Val Val Trp 
85 90 95 

Asn Glu Val Gin Phe Ser Glu Arg Lys Asn Tyr Lys Leu Gin Glu Glu 
100 105 110 



Lys Val Cys Ala Val Phe Asp Asn Leu lie Gin Leu Glu His Leu Asn 
115 120 125 
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He Val Lys Phe His Lys Tyr Trp Ala Asp He Lys Glu Asn Lys Ala 
130 135 140 

Arg Val He Phe He Thr Gly Tyr Met Ser Ser Gly Ser Leu Lys Gin 
145 150 155 160 

Phe Leu Lys Lys Thr Gin Lys Asn His Gin Thr Met Asn Glu Lys Ala 
165 170 175 

Trp Lys Arg Trp Cys Thr Gin He Leu Ser Ala Leu Ser Tyr Leu His 
180 185 190 

Ser Cys Asp Pro Pro He He His Gly Asn Leu Thr Cys Asp Thr He 
195 200 205 

Phe He Gin His Asn Gly Leu He Lys He Gly Ser Val Ala Pro Asp 
210 215 220 

Thr He Asn Asn His Val Lys Thr Cys Arg Glu Glu Gin Lys Asn Leu 
225 230 235 240 

His Phe Phe Ala Pro Glu Tyr Gly Glu Val Thr Asn Val Thr Thr Ala 
245 250 255 

Val Asp He Tyr Ser Phe Gly Met Cys Ala Leu Gly Met Ala Val Leu 
260 265 270 

Glu He Gin Gly Asn Gly Glu Ser Ser Tyr Val Pro G3 n Glu Ala He 
275 280 285 

Ser Ser Ala He Gin Leu Leu Glu Asp Pro Leu Gin Arg Glu Phe He 
290 295 300 

Gin Lys Cys Leu Gin Ser Glu Pro Ala Arg Arg Pro Thr Ala Arg Glu 
305 310 315 320 



Leu Leu Phe His Pro Ala Leu Phe Glu Val Pro Ser Leu Lys Leu Leu 
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325 330 335 

Ala Ala His Cys Xle Val Gly His Gin His Met lie Pro Glu Asn Ala 
340 345 350 

Leu Glu Glu lie Thr Lys Asn Met Asp Thr Ser Ala Val Leu Ala Glu 
355 360 365 

lie Pro Ala Gly Pro Gly Arg Glu Pro Val Gin Thr Leu Tyr Ser Gin 
370 375 380 

Ser Pro Ala Leu Glu Leu Asp Lys Phe Leu Glu Asp Val Arg Asn Gly 
385 390 395 400 

lie Tyr Pro Leu Thr Ala Phe Gly Leu Pro Arg Pro Gin Gin Pro Gin 
405 410 415 

Gin Glu Glu Val Thr Ser Pro Val Val Pro Pro Ser Val Lys Thr Pro 
420 425 430 

Thr Pro Glu Pro Ala Glu Val Glu Thr Arg Lys Val Val Leu Met Gin 
435 440 445 

Cys Asn lie Glu Ser Val Glu Glu Gly Val Lys His His Leu Thr Leu 
450 455 460 

Leu Leu Lys Leu Glu Asp Lys Leu Asn Arg His Leu Ser Cys Asp Leu 
465 470 475 480 

Met Pro Asn Glu Asn lie Pro Glu Leu Ala Ala Glu Leu Val Gin Leu 
485 490 495 

Gly Phe lie Ser Glu Ala Asp Gin Ser Arg Leu Thr Ser Leu Leu Glu 
500 505 510 

Glu Thr Leu Asn Lys Phe Asn Phe Ala Arg Asn Ser Thr Leu Asn Ser 
515 520 525 
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Ala Ala Val Thr Val Ser Ser 
530 535 

DATED this 13th day of February 1996 

THE COUNCIL OF THE QUEEN S LAND IN S TITUTE OF MEDICA L 
RESEARCH PVccs <vo..<^ O^co^o^Ce. «^ ^ 

By DAVIES COLLISON CAVE 
Patent Attorneys for the Applicant(s) 
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